Structural Analysis of Fault-Tolerance for Homogeneous Systems
نویسندگان
چکیده
منابع مشابه
Fault Tolerance for Digital Systems
Fault tolerance is an essential methodology for digital systems, particularly for those that serve applications where failure has safety implications or where interruption of operations imposes serious financial penalties. There is no single fault tolerance technique that suits or is optimal in all circumstances. A taxonomy of fault tolerance techniques is presented and branches and leaves of t...
متن کاملAdaptable Fault Tolerance for Real-Time Systems
This paper proposes a framework for software-implemented, adaptive fault tolerance in a real-time context. It extends previous work in two main ways: by including features that explicitly address the realtime constraints; and by a flexible and adaptable control strategy for managing redundancy within application software modules. This redundancy-management design is introduced as an intermediat...
متن کاملAdaptable Fault Tolerance Configurations for Multiprocessor Systems
The escalating increase in the complexity of multiprocessor systems increases the probability of faults occurring in these systems As a consequence there is a great need for achieving fault-tolerance of processing in multiprocessor systems. Faulttolerance generally requires some forms of hardware and/or time redundancy. Two fault tolerant configurations are proposed for both single and double t...
متن کاملIntrospective Fault Tolerance for Exascale Systems∗
Faults and errors are an unavoidable aspect of high performance computing systems. Emerging exascale systems will contain billions of hardware components and complex software stacks. In addition, higher fabrication density and power challenges will further compound fault detection, management and recovery. Efficient fault tolerance and resiliency frameworks are thus of immense importance in the...
متن کاملLayered Fault Tolerance for Distributed Embedded Systems
This thesis deals with principles and techniques of fault tolerance for distributed embedded systems. A layered approach is taken to achieve high dependability by structuring error detection and recovery mechanisms into three layers. The first layer consists of mechanisms implemented in hardware, either at the circuit or the micro-architectural level. Many integrated circuits, especially microp...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Transactions of the Society of Instrument and Control Engineers
سال: 1997
ISSN: 0453-4654
DOI: 10.9746/sicetr1965.33.441